Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

نویسندگان

چکیده

The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient largescale high-quality training datasets. We present two QMDS datasets, which we construct using data augmentation methods: (1) transferring commonly used single-document CNN/Daily Mail summarization dataset to create QMDSCNN dataset, and (2) mining search-query logs QMDSIR dataset. These datasets have complementary properties, i.e., real summaries but queries are simulated, while simulated summaries. To cover both these summary query aspects, build abstractive end-to-end neural network models on combined that yield new state-of-the-art transfer results DUC also introduce hierarchical encoders enable a more efficient encoding together with multiple documents. Empirical demonstrate our methods outperform baseline automatic metrics, as well human evaluations along attributes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RelationListwise for Query-Focused Multi-Document Summarization

Most existing learning to rank based summarization methods only used content relevance of sentences with respect to queries to rank or estimate sentences, while neglecting sentence relationships. In our work, we propose a novel model, RelationListwise, by integrating relation information among all the estimated sentences into listMLE-Top K, a basic listwise learning to rank model, to improve th...

متن کامل

A Query Focused Multi Document Automatic Summarization

The present paper describes the development of a query focused multi-document automatic summarization. A graph is constructed, where the nodes are sentences of the documents and edge scores reflect the correlation measure between the nodes. The system clusters similar texts having related topical features from the graph using edge scores. Next, query dependent weights for each sentence are adde...

متن کامل

Query Focused Abstractive Summarization: Incorporating Query Relevance, Multi-Document Coverage, and Summary Length Constraints into seq2seq Models

Query Focused Summarization (QFS) has been addressed mostly using extractive methods. Such methods, however, produce text which suffers from low coherence. We investigate how abstractive methods can be applied to QFS, to overcome such limitations. Recent developments in neural-attention based sequence-to-sequence models have led to state-of-the-art results on the task of abstractive generic sin...

متن کامل

Ontology and Query-Focused Multi-Document Summarization System

Due to the increasing growth of online information on the specific topic, Multiple Document Summarization (MDS) has become a non-trivial task. The MDS facilitates the user to understand the large volume of information in a short time by creating a concise and comprehensive summary. In addition, user’s query based MDS system provides a consistent summary, including the core of the information. T...

متن کامل

Experiments in Cross Language Query Focused Multi-Document Summarization

The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual information robustly and efficiently, with as high quality performance as possible. Previous research activities on multilingual information access systems have studied cross-language information retrieval (CLIR), information ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i15.17611